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We study analytically the late time statistics of the number of particles in a growing tree model 
introduced by Aldous and Shields. In this model, a cluster grows in continuous time on a binary 
Cayley tree, starting from the root, by absorbing new particles at the empty perimeter sites at a rate 
proportional to c~ l where c is a positive parameter and I is the distance of the perimeter site from the 
root. For c = 1, this model corresponds to random binary search trees and for c = 2 it corresponds 
to digital search trees in computer science. By introducing a backward Fokker-Planck approach, we 
calculate the mean and the variance of the number of particles at large times and show that the 
variance undergoes a 'phase transition' at a critical value c = v2- While for c > v2 the variance 
is proportional to the mean and the distribution is normal, for c < v2 the variance is anomalously 
large and the distribution is non-Gaussian due to the appearance of extreme fluctuations. The model 
is generalized to one where growth occurs on a tree with m branches and, in this more general case, 
we show that the critical point occurs at c = y/m. 
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I. INTRODUCTION 



Growing clusters are ubiquitous in nature and they exhibit fascinating structures and patterns. Examples range from 
natural fractals, such as snowflakes and soots, to artificial structures such as networks, for example the Internet and 
social networks. Various growth models have been studied extensively by physicists over the last three decades In 
these models growth starts from a single seed site and proceeds via absorbing new particles into the cluster accordingto 
certain specified rules. Different growth rules give rise to different growth models, examples being the Eden model 0], 
invasion percolation diffusion limited aggregation £| and the growing network models [j| which have recently 
received much attention. There are two reasons why many of these growth models are often studied on a Cayley tree 
(or on the Bethe lattice) 0. First, the tree structure of the Bethe lattice mimics a Euclidean lattice in the limit 
of high dimensions where the mean field theory often becomes exact. Secondly, the absence of loops on the Cayley 
tree often allows one to obtain exact analytical solutions which are very difficult to obtain on a regular (i-dimcnsional 
lattice. There is yet another compelling motivation for studying these growth models on a Cayley tree and this comes 
from computer science. 'Storing and Search' of data is a very important area of computer science Incoming data 
to a computer is usually stored on a Cayley tree by using various data storage algorithms and the tree so grown is 
called a 'search tree' Q. Different algorithms lead to different search trees and in some cases, as explained below, the 
rules of growth of a search tree can be shown to be exactly equivalent to a 'physical' growth model on the tree. Thus 
the study of these physical growth models on a Cayley tree provides important insights into data storage in computer 
science. 

As a first example of this equivalence between a physical growth model and a search tree, we show here that the 
Eden model on a binary Cayley tree is exactly equivalent to the random binary search tree (RBST). Consider the 
Eden model on a binary Cayley tree where the growth starts from the root [6j. At the first step, a particle gets 
absorbed at the root, thus forming a cluster of size 1. This cluster has now two empty neighbors which defines the 
perimeter of the cluster. At the next step, a new particle will get absorbed at any of these two perimeter sites chosen 
k> , with equal probability, thus forming a cluster of size 2. The subsequent growth occurs following the same rule, namely 
a new particle gets absorbed at any of the perimeter sites chosen with equal probability. In Fig. (1), we show a cluster 
after 4 steps where the black sites denote the cluster and the shaded sites denote the current perimeter sites that are 
available for subsequent growth. Fig. (2) shows all possible Eden clusters of size 3 and their associated statistical 
weights. 

On the other hand, a binary search tree in computer science is constructed by the following simple algorithm 
Imagine that we have a data string consisting of N items which are labeled by the N integers: {1, 2, . . . , N}. These 
could be the months of the year or the names of people etc. Let us assume that this data appears in a particular 
order, say {6, 4, 5, 8, 9, 1, 2, 10, 3, 7} for N — 10 integers. This data is first stored on a binary tree following the simple 
dynamical rule: the first item 6 is stored at the root of the tree (see Fig. ©)• The next item in the string is 4. We 
compare it with 6 at the root and since 4 < 6, we store 4 in the left daughter node of the root. Had it been bigger 
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FIG. 1: An Eden cluster of size 4 on a Cayley tree. The black sites form the cluster and the shaded sites form the perimeter. 
At the next step, growth can occur at any of the 5 shaded perimeter sites with equal probability 1/5. 
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FIG. 2: All possible Eden clusters of size 3 on a tree and their associated statistical weights w. 

than the root item 6, we would have stored it in the right daughter node. The next item in the string is 5. We again 
start from the root, see that 5 < 6, so we go to the left branch. There we encounter 4 and we find 5 > 4, so we go 
the right daughter node of 4. This process is continued till all the N = 10 items are assigned their nodes and we 
get a unique binary search tree (BST) (see Fig. |JS}) for this particular data string {6, 4, 5, 8, 9, 1, 2, 10, 3, 7}. Usually 
the data arrives at a computer in random order. To study this situation, one considers the simplest model called the 
'random binary search tree' (RBST) model where one assumes that the incoming data string can arrive in any of the 
N\ possible orders or sequences, each with equal probability |9j. For each of these sequences, one has a binary tree. 
For example, in Fig. (4), we show the binary trees for N — 3 along with their associated probabilities. 

Comparing Fig. (2) and Fig. (4), one sees immediately that the Eden trees after 3 steps have exactly the same 
configurations and statistical weights as the random binary search trees with data size N = 3. This analogy can be 
easily extended to all N. The key point is that after (n — 1) steps there are (n — 1) occupied sites in the Eden cluster 
and n perimeter sites (this is easy to understand as the addition of a new occupied site eliminates one old perimeter 
site while creating two new perimeter sites) . The probability of subsequent growth at step n at any of these perimeter 
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FIG. 3: The binary search tree associated with the data string {6, 4, 5, 8, 9, 1, 2, 10, 3, 7}. 
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FIG. 4: All possible random binary search trees for a data of size N = 3 and their associated statistical weights. 



sites is p n — l/n. Thus the statistical weight of a cluster of N sites formed by a specific history of growth is simply 
w = P1P2 ■ ■ - Pn = which is the same as in the RBST model. Thus the Eden model on the Cayley tree is 

exactly equivalent to the RBST. 

Anothe r popula r search tree model is known as the 'digital search tree' (DST) which is constructed by the following 
rule 0, H, llll H^ . IT3I 0, 0. Consider again a binary Cayley tree each node of which can contain at most one 
entry. One starts with an empty tree and the data is stored sequentially. The first data item is stored in the root of 
the tree. The next one arrives at the root and finding it occupied, moves to any of the two empty daughter nodes 
chosen at random and occupies that node. Then the next item arrives and again it starts at the node, chooses any of 
its two daughters randomly and moves there. If the chosen daughter is empty it occupies it. If the chosen daughter is 
already occupied, it again chooses one of its two descendants at random and moves there. Thus at any stage, a new 
entry starts at the root and performs a random walk (to the left or to the right daughter with equal probability) down 
the tree till it finds an empty node and occupies the node. Thus one obtains again a growing tree where at any stage 
growth can occur at any of the perimeter sites, but now the growth probability at a perimeter site a is p a cx 2~ la where 
I a is the distance of the perimeter site from the root. The DST is an important tree structure in computer science and 
has been studied extensively. In particular, it turns out the DST is a natural tree representation [lj, Ua °f the data 
compression algorithm due to Ziv and Lempel |17| . Recently it was shown that a diffusion limited aggregation model 
introduced by Bradley and Strenski fl8l | in physics is exactly equivalent the the DST model in computer science and 
a variety of exact results were obtained by exploiting this connection . 

The examples above illustrate a profound link between growth models and the dynamics of search tree formation in 
computer science. Note that the two search tree models discussed above, the RBST and the DST, can be considered 
as special cases of a general growth model where growth occurs ( i.e. a new particle gets absorbed) at any of the 
available perimeter sites a with a growth probability p a cx c~ la where c is a constant positive parameter and l a is the 
distance of the perimeter site a from the root of the tree. The RBST (equivalently the Eden model) corresponds to 
c = 1 so that all perimeter sites have equal probability to absorb a particle. The DST, on the other hand, corresponds 
to c = 2 as discussed above. It is then useful and interesting to study this general growth model parametrized by 
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c and ask if there are any qualitative changes in the statistical properties of the growth clusters as one varies the 
parameter c continuously. Indeed, Aldous and Shields studied a continuous-time version of this generalized growth 
model ^||- Note that in the two models discussed above time is discrete and is equal to the number of particles 
in the tree. In the version of the model studied by Aldous and Shields, time is considered continuous and growth 
occurs at any of the available perimeter sites say the site a with a rate proportional to c~ la where c is a positive 
parameter. In this continuous-time model, the total number of particles in the tree at time t is thus a random variable, 
unlike in the discrete time version. Thus, while the discrete-time model has a constant particle number ensemble, the 
continuous-time model has a constant time ensemble, much like the canonical and the grand canonical ensemble in 
statistical physics. Asymptotically at long times, both the discrete-time and the continuous- time versions of the model 
are expected behave in a similar fashion. Henceforth, we will consider in this paper only the continuous-time version 
a la Aldous-Shields, since it is, from a technical point of view, easier to study than its discrete-time counterpart. 

The question naturally arises whether the statistical properties of the growing clusters in this model undergo any 
qualitative change of behavior as one tunes the parameter c continuously. Indeed, Aldous and Shields established 
rigorous probabilistic bounds to show that the nature of the fluctuations (variance) in the number of particles in the 
tree at time t is qualitatively different for c < \/2 and c > \ 2. While for c > \[2 the central limit theorem holds and 
the total number of particles has a limiting Gaussian distribution 16], for c < y/2 the central limit theorem breaks 
down due to the appearance of anomalously large fluctuations. Thus, there is a sharp phase transition in the nature 
of the fluctuations at a critical value c = y2. However, the mechanism responsible for this phase transition and even 
the explicit quantitative behavior of the fluctuations above, below, or at the critical point were not easy to obtain 
within the rigorous probabilistic analysis of Aldous and Shields. The principal purpose of this paper is to provide 
a detailed quantitative understanding of this rather 'peculiar' phase transition. Our method, completely different 
from the original approach of Aldous and Shields, employs a backward Fokker-Planck formalism. The advantage of 
this method is that one can obtain exact asymptotic results explicitly. Moreover, our analysis also shows that the 
mathematical mechanism behind this phase transition is similar to the phase transitions found recently in the variance 
of the number of nodes needed to store data on a m-ary search tree (where m is the number of branches) at the critical 
value m = 26 9, 19, 20, 21, 22,£3j,£J and also in the variance of the number of splitting events in a D-dimensional 
fragmentation model at the critical value D c = n/sm^ 1 (l/\/8) = 8.69363 . . . [H|2(j. 

The layout of the paper is as follows. In the next Section (II), we define the model precisely and summarize the 
main results. We study here a generalized Aldous-Shields model where the growth takes place on a Cayley tree with 
m branches. In Section III, we derive the evolution equations for the mean and variance of the number of occupied 
sites as a function of time via a backward Fokker Planck technique. A simple scaling analysis is then carried out 
to determine the temporal growth exponents. In Section IV a more thorough analysis of the evolution equations is 
provided that enables us to obtain explicitly not just the growth exponents, but also exact expressions for various 
amplitudes and prefactors that include interesting log-periodic oscillations. We conclude with a summary and a 
discussion of open questions in the last section. 



We consider a generalized Aldous-Shields model where growth occurs on a Cayley tree (rooted at O) with m 
branches (see Fig. EJ). Aldous and Shields studied only the binary case m = 2. Initially the tree is empty and 
growth occurs in continuous time starting from the root O. At any instant t, one first identifies the available perimeter 
nodes. A node a at time t is a perimeter node if it is empty at t but its parent node is occupied at t (see Fig. [5J ■ 
Subsequently, in a small time interval At, a perimeter node a either absorbs a particle with probability c~ la At or 
remains unoccupied with probability (1 — c~ la At), where l a is the depth of the perimeter node a, i.e. its distance 
from the root O. This growth process occurs simultaneously at all the perimeter nodes. Thus, the total number of 
particles n(t) in the tree rooted at O is clearly not a fixed number at a given time t, instead it is a random variable 
in the sense that the value of n{t) differs from one history of evolution to another. We are interested in computing 
the statistics of n(t) at large times t. 

In this model, we have two parameters m and c. It is useful to first summarize our main results. Using a backward 
Fokker-Planck approach we derive an exact evolution equation for the generating function, 



II. THE MODEL AND THE RESULTS 
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where the angle brackets denote an average over all histories of the evolution process and P(n, t) is the probability 
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FIG. 5: The growth of the Aldous-Shields model on a tree with m branches and rooted at O. The filled circles are occupied 
nodes and the shaded ones are the perimeter nodes where growth can occur subsequently. For example the site marked a is a 
typical perimeter site at a distance l a = 3 from the root O of the tree. 



distribution of n at time t. We show that G(/z, f) evolves via the equation 

dG(ji, t) 



dt 



= -G(»,t)+e-»G m (fi,t/c), (2) 



starting from the initial condition G(fi, 0) = 1. By differentiating G(/i, t) with respect to fi and putting fi = 0, one can 
also derive the evolution equations for all the moments of n(t). The equation J5J is nonlinear and nonlocal in time 
for generic values of c and to, and is thus difficult to solve exactly, except for the c = 1 case when it becomes local. 
However, we were able to compute exactly the asymptotic large time behaviors of the mean and the variance of n(t) 
for arbitrary to and c. Below we present our results for the three different cases c = 1 , c < 1 and c > 1 separately. 

The case c = 1: In this case our model is precisely the continuous-time version of the Eden model. This case c = 1 
is exactly solvable since the evolution equation J2J becomes local in time. We solved for G(/x, t) and obtained the 
following explicit result for the distribution P{n, t) for all to and t 

P(n.l)= . V ^ m ~V e -t 1 _ e - {m -l)t 

r(^i)r(n+i) L J 

where r(x) is the standard Gamma function. The mean number of particles M(t) = (n{t)) increases exponentially in 
time for all m > 1, 

M(t) = — V — [exp ((m - l)t) - 1] . (4) 

For the special case m = 1 (a line with a constant rate of deposition), M(t) = t and the distribution P(n, t) — e - *t™/n!, 
obtained from Eq. @ by taking the limit m — > 1, is purely Poissonian as expected. 

The case c < I: Since the growth rate at a perimeter node a is proportional to c~ la where l a is the distance of the 
node from the root O, it is clear that for c < 1, farther a perimeter node is from the root, the larger is its probability 
to get occupied. Thus the cluster grows in a rather ramified manner where long branches grow faster than the short 
branches. In this case we expect that the mean number of sites grows at least exponentially. But since physically this 
case is of little interest, we do not discuss it further in this paper. 



The case c > 1: We now come to the physically most relevant case c > 1. We show that in this case there is a 
sharp phase transition in the asymptotic statistics of n(t) across the critical line c = \pm in the (to, c) plane with 
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FIG. 6: The phase transition as a function of a — ln(m)/ln(c) > 0. The critical point at a c — 2 separates the phase (a < 2) 
with normal fluctuations from the phase (a > 2) with anomalously large fluctuations. 

rn > 1 and c > 1. We calculated exactly the asymptotic time dependence of the mean M(t) = (n(t)) and the variance 
V(t) — (n 2 (t)) — M 2 (t), for all values of m > 1 and c > 1. We show that while the fluctuations are normal (i.e. 
the variance of n{t) is proportional to its mean) for c > s/rn, they are anomalously large for c < s/rn. Even though 
we have two parameters c and m, it turns out that the asymptotic behaviors can be described in terms of the single 
growth parameter 

ln(c) 

where a > since c > 1. In terms of a, the phase transition takes phase at the critical value a c = 2. The normal 
phase for c > y/m corresponds to a < a c = 2 and the anomalous phase for c < y/m corresponds to a > a c — 2. 
More precisely, we find that for large t, the mean M{t) grows as a power law (up to corrections periodic in ln(t)), 

M(t)~At a , (6) 

and we provide an explicit expression for the amplitude A. The variance, on the other hand, has different behaviors 
for c < yfm and c > y/m or equivalently for a > 2 and a < 2. We show that the variance at large times t, again up 
to log-periodic corrections, grows as 

V(t) ~ B 1 t a for a < 2 (7) 
~ B c t 2 ln(f) for a = a c = 2 (8) 
~ Bt 2a - 2 for a > 2. (9) 

We also provide exact expressions for the amplitudes B' ', B c and In order to use these continuous-time results for 
the discrete-time model where the 'time' is same as the number of particles, it is instructive to eliminate the explicit 
t dependence in the results for the variance and instead express it as a function of the mean number of particles. 
Eliminating t between Eqs. ©, @, © and ©, we get 

V(t) - C M{t) for a < 2 (10) 

- C c M(t)ln(M(t)) for a = a c = 2 (11) 

- CM(t) 2 ~ 2/a for a > 2. (12) 

Explicit expressions for the amplitudes C, C c and C are likewise provided. 

We thus see that for a < 2 the fluctuations of n(t) about its mean value, denoted by An(t) = \/V(fj, are of order 
M 1 / 2 as is the case for a normal Gaussian or Poisson distribution. However for a > 2 we find that An(t) ~ Af 
for large M. For a > 2, we have that 1 — 1/a > 1/2 and hence the relative fluctuations about the mean become 
larger as we cross the threshold a = 2 from below. The phase a < 2, or equivalently c > y/m corresponds to a region 
of slower growth where the central limit theorem holds and the distribution of n(t) is asymptotically normal. On the 
other hand, a > 2 marks a phase where rapid growth tends to occur along a single branch resulting in anomalously 
large fluctuations. Thus the statistics of n(t) in this phase is dominated by extreme fluctuations. The nature of this 
phase transition is thus very similar to the ones recently reported in m-ary search trees [9j, ll!j, |2JJ, |2l|, |22j, 1 2 ■' jl 12 1| and 
a related fragmentation model 0, |2(j ■ 

We end this section with a remark on the usage of the term 'phase transition'. The 'phase transition' observed 
in this model refers to the abrupt change of the variance (and also that of the full distribution) of the number of 
particles n(t) in the tree as one changes the parameter a — ln(m)/ln(c) through its critical value a c = 2. This may 
not correspond to the traditional definition of 'phase transition' used in equilibrium statistical mechanics, e.g. the 
divergence of a correlation length as one approaches a critical point as in second order phase transition. The 'phase 
transition' in the Aldous-Shields model is closer to the change of behavior one observes in the diffusion of a Levy 
walker. A Levy walker jumps, at each step, by a random length I drawn from a power law distribution, p(l) ~ 
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for large I with 7 > 1 (required for normalization). It is well known[25j that the root mean square displacement of the 
particle after n steps U ~ n 1 / 2 for large n only when 7 > 2, i.e. one gets normal diffusion and the asymptotic 
position of the walker is distributed normally. On the other hand, for 1 < 7 < 2 one gets anomalously large diffusion, 
yj (-R 2 ) ~ n 1 /' 1 for large n and the asymptotic distribution of the position of the walker is non-Gaussian. Thus there 
is a change of behavior at the critical value j c = 2. The change of behavior in the variance of the number of particles 
in the Aldous-Shields model at the critical value a c — 2 is thus similar in nature to the the change of behavior seen 
for the Levy diffusion at j c = 2, rather than the standard 'phase transition' observed in critical phenomena. 



III. DERIVATION OF THE EVOLUTION EQUATIONS 



In this section we will derive the evolution equation for the probability distribution, and in particular the evolution 
equations for the mean and the variance, of the total number of occupied sites n(t) at time t in a tree rooted at the 
site O. The root O, by definition, has level or depth Iq = 0. The method of derivation is based on a backward Fokker 
Planck formalism which involves considering the future evolution of n(t) conditioned on what happens in the first 
infinitesimal time interval (0, At). Since our final aim is to derive a recursion relation for the evolution process, it 
is convenient to first derive the evolution equation for the number of particles n a (t) in a subtree rooted, say, at any 
arbitrary site a. By definition, n a (t) includes the particle at the root a. The number of particles in the full tree is just 
a special case when the site a is chosen to be the original root O of the full tree. We now count the 'local' time t (for 
this subtree) from the instant the site a becomes a potential growth site, so that, by definition, n o (0) = 0. Clearly, 
at any given t, the distribution P(n a ,t) of n a (t) depends only on t and l a , the depth of the site a. This means that 
one can write 



(F(n a (t))) = V F(n a )P(n a ,t) = f(t;l a ) 



(13) 



n„=0 



where F(x) is any arbitrary function. 

Consider now the site a with its descendants ax, a-i, ■ ■ ■ ,a m , where a is a potential growth site at t = 0. By 
definition a is unoccupied at t = and thus ax, a-i, ■ ■ ■ ,a m are not potential growth sites at t = 0. In the first 
infinitesimal time interval (0, At) there are two possibilities: (i) either no particle fills the potential growth site a, thus 
n a (t) = n a (t — At). This happens with probability 1 — c~ la At. (ii) the other possibility is that the potential growth 
site a is filled by a particle with probability c~ la At and as a consequence the number of particles in the subtree rooted 
at a is increased by one and the daughter nodes ax, 02, • • • , a rn all become potential growth sites. Mathematically we 
can write the above evolution in the following way: in the time interval (0, At) 



i a (t) =n a (t- At)(l-i) + 



(14) 



where / is a random variable which takes the value 1 with probability c~ la At and with probability 1 — c~ la At. 
Taking the expectation of Eq. 114fl with respect to I and the subsequent growth process in the remaining time t — At 
we obtain, upon taking the limit At — > 0, 



5 <n (*)> = c-'- 



1-K(i)>+X> a< (t)> 



i=l 



(15) 



We now use the property that the statistics of the number of particles in a subtree rooted at level a depends only 
on l a as encoded in Eq. (|13fl to obtain 



—M{t; l a ) = c~ l « [1 - M (t; l a ) + mM (t; l a + 1)] 
at 



(16) 



where we have defined 



M(t;l a ) = (n„(t)) 



(17) 



and have used the fact that by definition l ai = l a + 1 if is a daughter of the node a. Note that the root O of the 
full tree has depth lo = 0. Thus the mean M(t) = (n(t)) of the total number of particles in the full tree rooted at O 
is simply, M(t) = M(t, 0). Thus to obtain M(t), our strategy is to find the solution M{t, l a ) of Eq. I|16|) for arbitrary 
l a and eventually put l a = 0. 
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The next step is to notice that as one goes down a level in the tree the growth rate is reduced by a factor of c which 
amounts to rescaling time by a factor of 1/c. In the notation of Eq. I|13|) this means that one may write 

f(t;l a + l)=f(^l a Y (18) 

Next we put l a = in Eq. (|16f) . use the definition M(t;lo) = M(t;0) = M(t) and also the scaling property in Eq. 
(|T%|| to obtain, 

4-MYi) = 1 - M(t) + mM ( - ) . (19) 
at \cj 

This equation, supplemented by the boundary condition M(0) = 0, then describes the evolution of the mean number 
of occupied sites. 

An equation for the variance of n(t) can be derived in a similar fashion. The starting point is obtained by squaring 
the stochastic evolution equation l|14|) and then taking the expectation over / and the evolution in the remaining time 
t - At. This yields 



(nl(t)) = {nl(t-At))(l-c- l «At) + 

We now use the fact that the subtrees rooted at sites at the same level are statistically independent and so 

(n at {t)n aj (i)) = (n ai (t)) (n . (*)) for i £ j. (21) 
Now defining the variance of the number of sites occupied in the tree rooted at as 

V(t) = {n 2 (t))-(n(t)) 2 (22) 
and using the scaling relation Eq. I|18|) . after some elementary algebra, we obtain 

J t V(t)=(±M(t)) 2 -V( t)+ rnv(^. (23) 

The boundary condition for this equation is clearly V(0) = 0. 

Another way to obtain the equations for M and V is by deriving directly an evolution equation for the generating 
function G(/j,,t) of n(t) defined as in Eq. 0}. Following exactly the same backward Fokker-Planck strategy as used 
for the mean, it is straightforward to show that G(/x,i) evolves by the nonlinear nonlocal equation (J2J). The moment 
equations, and equations for the higher moments, Eq. (|19fl and Eq. I|23[) can be obtained by differentiating Eq. @ 
with respect to /i the appropriate number of times and setting fj, = at the end. 

The evolution equation @ is difficult to solve explicitly for generic values of c and m since it is a nonlinear (for 
m 7^ 1) and nonlocal (for c ^ 1) equation. However, exact results can be derived in a few cases that we consider 
below. The asymptotic solution for the mean and variance for generic m and c will be presented later in the next 
section. 
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At) 
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(20) 



A. Exact Solution for the Eden Growth c = 1 



The case c = 1 corresponds to the Eden model where growth occurs at any of the available perimeter sites with 
equal rates. For c = 1, Eq. becomes local in time t and can be explicitly solved. We find that for all m > 1 



GGM) 



1 



MM g-(™- 1 )* 



-l/(m-l) 



(24) 



Expanding the r.h.s. of Eq. (|24ll in powers of as in Eq. (Q, one can then read off the distribution P(n,t) 
explicitly as in Eq. © . The mean number of particles grows exponentially for all m > 1 as in Eq. (0} . Similarly, one 
can compute the variance V(t). We find 



V{t) 



1 e (m-l)t ^ e (m-Dt_^ . 



(m — 1) 



(25) 
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For a fixed m, if one takes the limit of large t and large n keeping the product ne ( m fixed in Eq. 10, one finds 
an asymptotic distribution 

(26) 



Thus, to the leading order, the distribution P(n, t) decays exponentially for large n over a characteristic size n* ~ 
e (m-i)t fojgfc grows exponentially with time t. Interestingly, the distribution has a sub-leading power law tail (in 
addition to the leading exponential tail) where the exponent <j>= (to — 2)/(m— 1) depends continuously on m. 

For the special case m = 1, where we have just a line of sites and the particles arrive at an empty available site at 
a constant rate 1, we get from Eq. Q), M(t) — t. The full distribution, from Eq. ©, becomes a Poisson distribution 
P(n,t) — e~ l t n jn\ as expected. 



P(n,t) 



i 

(m-l) 



n -(m-2)/(m-l) exp 



B. Exact Solution for the Digital Search Tree Growth c = m 

The case c = m corresponds to the case where particles arrive at a constant rate at the root O and then each 
carries out a random walk down the tree until it finds a free site to occupy. During its downward journey in the 
tree the particle, after arriving at any occupied site, chooses one of its m descendants at random. This is precisely 
the algorithm for constructing a m-axy digital search tree 0, El El • If the rate at which the particles arrive at the 
root O is one then the total number of particles in the tree at time t, n(i), is clearly a random variable with Poisson 
distribution 

P(n = M) = ^=^, (27) 

where k = 0, 1, 2 • • • is a positive integer. This yields 

M{t) = t; V(t) = t, (28) 

which we see immediately are the solutions to Eq ()19f) and Eq. ll'lil . Furthermore we see that the generating function 
G(fi, t) for a Poisson distribution is given by 

G(t,fi) = exp(-i + texp(-|u)) . (29) 

It is easy to check that indeed this solves Eq. (0) in the case in = c. 



C. A self-consistent scaling approach for the leading asymptotic growth of the mean and the variance for 

c > 1 and m > 1 



The late time asymptotic behavior of Eq. (|19|l and Eq. I|23() for c > 1 and m > 1 may be deduced quite simply by 
making a self-consistent ansatz for the late time behavior of M and V. First consider Eq. (|19|l . We make the ansatz 

M(t)~At a . (30) 

Substituting this into Eq. (|19|l we may neglect the derivative term on the l.h.s. and assuming that a > (i.e. c > 1), 
matching the coefficients of t a gives 

--1 = 0, (31) 

c a 



which yields 

ln(m) 



(32) 



For non-trivial tree structures we are always in the situation where m > 2 and for the above solution to make sense 
we require that c > 1 to have a positive exponent a. While this simple minded scaling approach yields the correct 
power law growth of M(t) ~ At a for c > 1, it does not provide us the value of the amplitude A. To derive an exact 
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expression for A, we need to solve the full nonlocal equation l|19fl at late times, and this will be carried out in the 
next section. 

Let us make a similar power law ansatz for the late time behavior of V(t) 

V^cuBt 13 . (33) 
Substituting this into Eq. I123II and neglecting the derivative term we obtain 

- Bt + Bm-s + A 2 a 2 t 2a ~ 2 = 0. (34) 

cr 

Asymptotically there are two ways to satisfy this equation. First if we assume a priori that (3 > 2a — 2 then the first 
two terms in Eq. (|34|l must cancel leading to — m, i.e. (3 — ln(m)/ln(c) = a. The a posteriori condition that this 
solution is valid is thus a > 2a — 2, which means a < 2. The second possibility is that all three terms contribute and 
thus (3 — 2a — 2. In this case we find that 

A 2 a 2 A 2 a 2 

B =1-^ = I^' (35) 

and in obtaining the last equality in Eq. I|35|) we have used m = c a . However for this solution to make sense we must 
have that B > because V(t) is clearly positive, consequently Eq. I|35[) can only hold when a > 2 (since c > 1). 

This simple minded scaling approach thus indicates that there is a phase transition in the late time behavior of the 
variance V (t) at the critical parameter value a c = 2. For a < 2, we have V (t) ~ B' t a where the amplitude B' can not 
be determined by the scaling approach. On the other hand, for a > 2 the scaling approach indicates V(t) ~ Bt 2a ~ 2 
and moreover it provides a relationship between the amplitudes B and A (of the mean) via Eq. I|35|l . The critical 
point a c = 2 thus separates the region of normal growth a < 2 (or equivalently c > \fm) 1 where V{t) ~ M(t), from 
the the region a > 2 (i.e. c < \fm) where the variance grows anomalously faster V(t) ~ [M(t)] 2 ~ 2 / a . In the next 
section, we will see that the analysis of the full nonlocal equations (|19fl and l|23|) indeed corroborates theses scaling 
results, and in addition produces exact expressions for all the amplitudes. 

Before proceeding to the full analysis of Eqs. I|19|l and (|23|l in the next section for generic c > 1 and m, it is 
instructive to note that analytic progress is also possible for Eq. (jT§|l in the case where a = ln(m) / ln(c) is a positive 
integer. This includes, in particular, the critical point a — 2. We make the following ansatz 

oo 

M(t) = J2 b nt n , (36) 

where the term k = in the above sum is omitted in order to respect the initial condition M(0) = 0. Matching 
powers of t on substituting this ansatz into Eq. (|19fl yields 

bi = 1 

b k+1 = 6 fc _( ? -l) for k>l. (37) 

We thus see that if there exists a positive integer k* such that a = ln(m)/ln(c) = k* then bk = for all fc > fc* = a 
and we have found the solution to Eq. (fTT)|l in these cases. At late times the leading order behavior is thus dominated 
by the term containing t a and we get 

= ^(c a - 1 -l)(c a - 2 -l)---(c-l). (38) 



a 

In particular, at the critical point a = 2, we get for large t 

M(t) ~ t 2 (39) 

Thus, at this special point a — 2, we have even managed to compute the amplitude A — (c — l)/2 of the mean 
M(t) ~ At 2 exactly. In the case a > 2, the behavior of the variance V(t) now follows immediately from Eq. I|35|l . 
Finally, exactly at the critical point a = 2, we may asymptotically solve Eq. Q23JI with the ansatz V — Bt 2 \n(t) to 
yield 

ln(c) ln(c) ' ^ 



where in the last line of Eq. I|40|) we have used A = (c — l)/2. 
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IV. GENERAL SOLUTION OF THE EVOLUTION EQUATIONS OF THE MEAN AND THE 

VARIANCE 

The full solutions to the nonlocal and nonlinear differential equations of the type in Eqs. (|19I23(I are rather difficult 
to obtain completely. Here we obtain the exact asymptotic solutions following an approach similar to the one used 
by Flajolet and Richmond 13] in solving a class of difference-differential equations arising in the context of digital 
search trees. 

A. Solution for the mean M(t) 

We start by the analysis of Eq. (|19|) assuming c > 1 and m > I. Taking the Laplace transform of Eq. I|19|l we 
obtain 

sM(s) = - - M(s) + mcM(cs) 7 (41) 
s 

where 

/>OG 

M(s) = dt exp(-st)M(t) (42) 
Jo 

and we have used the initial condition M(0) = 0. The above may be written as 

s(s + l) (s + 1) 

Now as M(s) should go to zero as s — > oo and we are considering the case c > 1, we solve Eq. (|43[1 by iteration finding 

1 i 
M( s ) = -Y- — s — . (44) 

Note that taking the limit s — > is not straightforward in Eq. 144|l . This is because if we set s = in the sum on the 
r.h.s of Eq. (|44|l . the sum diverges since m > 1. Following 13] we introduce the function 



1=0 

Thus Q(s/c) = (1 + s/c)(l + s/c 2 )(l + s/c 3 ) .... On the other hand, 



Q( C 3 S ) = J|(l + d>- l s) = (1 + c J 's)(l + cJ^s) ■■■{! + cs)(l + s)(l + s/c)(l + s/c 2 ) . . . 

= (l + s)(l + cs)(l + c 2 s)---(l + c> S )Q( S /c). (46) 
Thus, one can rewrite the product (1 + s)(l + cs) ■ ■ ■ (1 + e's) = Q(c : 's)/Q(s/c). Using this in Eq. (|4*4*}> we get 

M(s) = H(a), (47) 

s 

where 

The next step is to take the Mellin transform of H(s) denned as 

H*(x) = / ds s x - 1 H(s). (49) 
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Substituting H(s) from Eq. lj4*8|l in the definition in Eq. l{4*5|) we get 



H*(x) 



3=0 



Q(ti>a) 



ds 



3=0 



Q{<T) 



da 



h*{x) 



mc 



where 



h*(x) 



da 



Q(?) 



(50) 



(51) 



and in evaluating the sum over j we have assumed Re(mc _I < 1) or equivalently Re(ir) > ln(m)/ln(c) = a. We also 
notice that h*(x) has no poles for Re(cc) > 0, and that the poles of 1/(1 — mc~ x ) are at Xk = a. + 27rifc/ln(c) where 
k = 0, ±1, ±2, . . . runs over all integers. All the poles of H*(x) are thus to the left of the line Re(x) = a. 
The inversion formula for the Mcllin transform is given by 



H(s) = — r / dx H 



(x)s- x , 



(52) 



where the above limits denote an integration up the imaginary axis to the right of all the poles of H* , therefore we 
chose limits with d > a. The contour may be closed in the left half plane (we assume that the integrand vanishes in 
the region Re(x) — > — oo) and we can thus evaluate the inverse Mellin transform in terms of the residues of the poles 
to the left of Re(x) < a, i.e 



H{s) = Res 

poles 



h*{x)s- x 



1 — exp (ln(m) — x ln(c)) 



(53) 



where Res denotes the residue at the pole in question. 

The large time behavior of M(t) is determined by the small s behavior of M(s). Now at small s the dominant 
behavior clearly comes from the poles Xk = a + 2mk/hi(c) running up the imaginary axis, any pole coming to the 
left of this line of poles will be higher order in s. We evaluate the residues in Eq. H53JI . substitute the resulting H(s) 
in Eq. I|47|l and then take the limit s — > to obtain the following asymptotic result 



M(s) 

where we have used the fact Q(0) 
h*(a) - 



- 1 ln(c) 



h*(a) + J2 h *( a + 2nik/\n(c))s^ 

fe#0 



1. Note that from Eq. lf5*T|) and Eq. ljl5fr. we have 

a 01 - 1 da 7T 1 - c°- k 



(l + a)(l + a/c)(l + a/c 2 ). 



sin(7ra) 



iit 

fc=i 



(54) 



(55) 



where the last equality follows from an identity due to Ramanujan 26]. Note that this identity explicitly shows that 
the function h*(x) has simple poles at the negative integers and zero but no poles for Re(x) > as was stated before. 

To extract the leading asymptotic behavior of M(i) for large t, let us first divide the Laplace transform M(s) into 
two parts, M(s) = M p (s) + Mi(s) where M p (s) denote the first term on the r.h.s. of Eq. (|54() and M;(s) corresponds 
to the remaining sum over k ^ 0. Subsequently the inverse Laplace transform M(t) = M p (t) + Mi(t) can also be 
divided into two parts. The term M p (s) has a pure algebraic form, thus its inverse M p (t) has a pure power law growth, 



M p {t) - A t a , 



(56) 



where the constant A can be evaluated as follows. If M p (t) has the form in Eq. (|56|l . its Laplace transform is 
M p (s) — AT(1 + a) s - ( 1+Q ). Comparing this with the first term in Eq. (|54|l gives 



A : 



h*(a) 



ln(c)L(l + a) ln(m)L(a) sin(7ra) 



iit 

fc=i 



(57) 
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where we have used T(l + a) = aT(a), the definition a — ln(m)/ln(c) and the explicit form of h*(a) from Eq. i|55[l . 
Here we note that when a is an integer, it can be verified that Eq. (|57|l agrees with Eq. (|38|) derived for discrete 
values of a in the previous section. 

The second contribution to M(s), M/(s) is given by a Fourier series in ln(s). The inverse Laplace transform of this 
term is difficult to obtain fully but it is easy to see that it gives rise to a late time behavior of the form 

M^t)- At a g(\n(t)) (58) 

where g(x) is a periodic function of a;. The final asymptotic result for large t is thus 

M(t) = M p (t) + Mi(t) ~ At a [1 + g (ln(<))] . (59) 

This exact result thus not only confirms the dominant power-law scaling predicted in section (III) up to log-periodic 
oscillations, but also provides an explicit formula for the amplitude A as in Eq. Q57JI. For example, let us consider 
the binary case m = 2. For the case, when c = 1, the formula in Eq. 1)57(1 gives A = 1, thus M(t) ~ t for large t. On 
the other hand, for m = 2, when c = \/2 (the critical point), one can show from Eq. (|57[1 that A = (\/2 — l)/2 and 
M(t) ~ (y/2 - l)t 2 /2 for large t. 

B. Solution for the variance V(t) 

We now examine the asymptotic behavior of the variance V(t) for large t using a similar formalism. The evolution 
equation l(23|l for the variance V(t) is similar to that for the mean M{t) in Eq. I(19|l except that the source term in 
Eq. (|23|) is (dM/dt) 2 , different from the source term 1 in Eq. I|19|) . Solution of Eq. I|23|) thus requires an explicit 
knowledge of how M(t) behaves with time. Taking the Laplace transform, V(s) — J °° V(t)e~ st dt in Eq. lIl'Mli and 
using V(0) = we obtain 



where 



Rearranging Eq. (jfJU|) gives 



which can be iterated to yield 



?V(s) = S(s) - V(s) + mcV(cs), (60) 



•S'(.s-) : ^ d tex V (-st) (^p) . (61) 



- = + rnc - 

1 + s 1 + s 



V(s)=Yti w-, (m f' -rS{c> S ). (63) 

P) ( 1 + s )( 1 + cs ) •••(! + c J s) 

Using m — c a and the function Q(u) defined in Eq. I)45|l we can rewrite Eq. (|63|l as 

V{ S )=Q(s/c)H 1 {s) (64) 

where 

*<•)-§ (■="•)' m 

The next step is to take the Mellin transform Hl{x) = / °° Hi(s)s x ~ 1 ds of Eq. I|65|l which gives, after a change of 
variable c J s —> s in the integration 

H* (x) = g (^+— Y £ ^f-'ds. (66) 
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Let us first assume that the integral 



h\(x) 



S{S) ^ds 



o Q(s) 



(67) 



exists (the conditions for which will be stated later). Then, for Re(x) > 1 + a, the geometric sum in Eq. ((66(1 converges 
(since c > 1) and we get 



Ht{x) 



h\{x) 



\ — c l+a— x ' 



Inverting this Mellin transform we get 



Hl ( 8 ) = — f°° +d K [ X J s- x dx = Y Res 

9td / , 1 — r l+ct-x 



h\{x) 



(68) 



(69) 



where the poles are at Xk = 1 + a — 2irik /ln(c) with fc = 0, ±1,±2, .... In Eq. 1(69(1 the integration is along the 
imaginary axis to the right of all the poles and then we close the contour over the left half plane. Evaluating the 
residues and substituting the results in Eq. I|64|) we get 



V(s) 



Q(s/c) 

,1+a 



ln(c) 



hl(l + a)+^2ht(l + a- 2irik/ ln(c)) 



(70) 



where h\{x) is given by Eq. (|67|l assuming that it exists. 

We now need to invert the Laplace transform in Eq. H7U|) to evaluate V(t). For large t, as usual, the dominant 
contribution will come from the small s behavior of V(s). Using Q(0) — 1 and assuming h*(X + a — 2mk/\n(c)) exists 
for all k = 0, ±1, ±2 . . ., it is clear from the small s behavior of V(s) in Eq. I|7U|) that for large t 



V{t)~B't a [l + G(ln(t))], 
where G{x) is a periodic function in x and the amplitude B' can be read off as 

t ^ , . [°° S(s) 



B' = 



r(l + a)ln(c) 



; where h*(l + a) 



Q(«) 



Ms. 



(71) 



(72) 



Having obtained the results in Eqs. (|71|l and 1)72(1 . we need to investigate when they are valid. These results are 
valid as long as the integral ft,*(l + a) in Eq. (|72|l exists. The existence of this integral depends on the small s behavior 
of the source function S(s) defined in Eq. H61jl. Using the asymptotic behavior of M(t) from Eq. (|59")l we find that 
for large t 



dMY 
~dT) 



A 2 a 2 t 2a ~ 2 [l+ 5l (ln(t)], 



(73) 



where g\{x) is a periodic function in x. Substituting this large t behavior of (dM/dt) 2 in Eq. (|61|l . it follows that, in 
the case a < 1/2, the integral converges to a nonzero constant as s — > 0. On the other hand, for a > 1/2, the integral 
diverges as S(s) ~ A 2 a 2 T(2a — l^^ 2 "" 1 ) as s — > 0. Up to the log-periodic oscillations, the leading behavior of S(s) 
for small s can be summarized as follows 



S(s) 



Cqs-^-V for a>l/2 
-ln(s) for a = 1/2 
A 1 for a < 1 j2 



where 



Co = A 2 a 2 T{2a - 1) 



(74) 
(75) 
(76) 



(77) 



is a positive constant for a > 1/2. Also, A\ = J °° (dM / dt) 2 dt is a constant that depends on the full form of M(t) and 
not just on its asymptotic behavior since for a < 1/2 the integral is convergent. Substituting this small s behavior 
of S(s) into the integral giving h\(l + a) in Eq. (|72|l and using Q(0) = 1, it is clear that the integral exists (no 
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divergence from the small s limit) only for a < 2. For a > 2, the integral does not exist since the integrand for small 
s scales as s 1- ". Thus the results in Eqs. (|71|) and 172|) hold only for a < 2. 

For a > 2, the above analysis breaks down and we need to employ a different method. We now go back to our 
starting equations l|64|) and l|65|) . It turns out that for a > 2, we can actually extract the leading small s behavior 
directly from these two equations. We directly substitute in Eq. 116511 the leading small s behavior of S(s) ~ Cos~ ( - 2q ~ 1 - ) 
from Eq. J2U) where C = A 2 a 2 T(2a - 1). Additionally we use Q(0) = 1. Eqs. and then yield in the s — > 
limit 

v(s) - E (c 2 - a Y = c i- a) (78) 

3=0 K 1 

where we have used a > 2 which ensures that the sum in Eq. (|78|l is convergent. Inverting the Laplace transform, we 
then get the large t behavior of V(t) for a > 2 

2 A 2 

V{t)^Bt 2a - 2 - where B = ^ g , (79) 

where the constant A is given in Eq. (|57p. Note that this result in Eq. 17911 for a > 2 is in perfect agreement with 
the self-consistent scaling approach used in Section-Ill. 

At the critical point a = 2, the analysis is more delicate. However, from the scaling approach of Section-Ill, we 
already know that for large t, V(t) ~ B c t 2 \n(t) with B c = (c — l) 2 /ln(c) as in Eq. (|40|l . Thus, the asymptotic 
behavior of the variance V(t) can be summarized as 

V(t) ~ B' t a for a < 2 (80) 
~ B c t 2 ln(t) for a = a c = 2 (81) 
~ St 2 "- 2 for a > 2, (82) 



where the three amplitudes are given by 



b' = _ \, „ r |H s «d s 



r(l + a)ln(c) y Q(a) 
B c = (c-l) 2 /ln(c) 

n- 2 4 2 

B - < 83 > 

where A is given in Eq. 1)57(1. Note that computing the amplitude -B' explicitly requires an integration over the full 
source function S(s) which is not so easy. Eliminating the time t between M(t) ~ At a and V(t) in Eq. (|82|) . one can 
express the variance V as a function of the mean M for large M as in Eqs. (|10fl , 1|11|) and i|12|) and one can read off 
the constants C", C c and C in terms of B' , i? c and i? and the amplitude A of the mean given in Eq. I|57|) . 

Let us end this section with a remark on the mathematical mechanism responsible for the phase transition in the 
variance of the number of particles in this Aldous-Shiclds model. We note that the exact evolution equations 119|) 
and (|23|l respectively for the mean and the variance are very similar — they are both linear and nonlocal in time, the 
only difference is in the source term. For the mean M(t) in Eq. (|19|l . the source term is a constant 1 (the first term 
on the r.h.s of Eq. (|19fl ). On the other hand, for the variance, the source term (dM/dt) 2 in Eq. (|23|l depends on 
the evolution of the mean. Thus, the mean feeds into the variance equation as an external source term leading to 
a competition between the growth induced by this external source term and the growth induced internally by the 
remaining two terms on the r.h.s of Eq. (|23|l . This competition between the external and the internal source is finally 
responsible for the phase transition in the asymptotic growth of V(t). For a < 2, the internal source term wins out 
and the variance grows similarly as the mean, V(t) ~ M(t) ~ t a , leading to the normal phase. On the other hand, 
for a > 2, the external source term wins out leading to a faster growth V(t) ~ t 2a ~ 2 ~ [M{t)] 2 ~ 2 / a characterizing 
anomalously large fluctuations. We note that a similar mechanism namely a " competition between the internal source 
and the external driving" was shown to be responsible for phase transitions in fluctuations in a class of fragmentation 
problems studied recently [9j, |2fJ. In these fragmentation problems, it was shown that the mean and the variance 
evolved via similar looking equations [fj |2(j, except that they differed in their respective source terms — the variance 
equation had a source term driven by the mean, in much the same way as in the Aldous- Shields model discussed here. 
Thus, it seems that this phase transition in fluctuations is quite generic as it occurs in a large class of problems and 
the mathematical mechanism responsible for it is as identified above. 
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V. SUMMARY AND CONCLUSION 



In this paper we have studied analytically a growing tree model introduced by Aldous and Shields. In this model, 
growth occurs in continuous time. One starts at t = with an empty Cayley tree with m branches rooted at and 
the tree grows, starting from the root site, by absorbing particles in continuous time. Each site can occupy at most 
one particle. At a given instant t, growth can occur only at the perimeter sites with a rate c~ l where c is positive 
parameter and I is the distance of the perimeter site from the root of the tree. For c — 1 this model is isomorphic to 
a continuous-time Eden model on a tree and also corresponds to the random binary search tree problem in computer 
science. For c = 2 this model corresponds to the digital search tree problem in computer science. 

We have introduced a backward Fokker-Planck approach that enabled us to study analytically the statistics of the 
total number of particles n(t) in the tree at large time t. We have shown that at large i, while the mean number 
of particles grows as a power law in time, M(t) ~ At a with a = ln(m)/ln(c) for all c > 1, the variance V(t) of 
the number of particles has two different behaviors depending on the value of the parameter a. While for a < 2 
V(t) ~ M(t) for large t, for a > 2 the variance grows anomalously quickly: V(t) ~ [M \t)] 2 ~ 2 ''' ' a . We have identified 
the mathematical mechanism behind this phase transition at the critical value a c — 2 and shown that it is qualitatively 
similar to the phase transitions recently encountered in a search tree problem and also in a related fragmentation 
problem. Essentially, for a < 2, the typical value of n(t) grows in the same way as the average and the distribution 
is asymptotically normal whereas for a > 2, the typical value does not grow the same way as the average and the 
distribution is characterized by large fluctuations caused by the faster growth of a single branch of the tree. 

We obtained detailed analytical results for the first two moments of the number of particles for generic values of the 
parameter a. However, we were able to calculate the full asymptotic distribution of the number of particles only for 
two specific values of a, namely for a — 1 (c — m) and a — > oo (c = 1). Fortunately these two representative values, 
where an exact solution is possible, fall respectively on either side of the critical point a c — 2. Our exact solution 
shows that for a = 1 (< a c = 2) the distribution P(n,t) is Poisson and hence is asymptotically normal for large n. 
On the other hand for a — ► oo, the asymptotic distribution is certainly non-Gaussian, P(n,t) ~ vT^ exp[— ne"' m_1 ' f ] 
where the exponent (f> = (m — 2)/(m — 1) depends on m. The calculation of the distribution for other values of a 
remains a challenging problem. 

While we have studied this growth model on a tree because of its connections to the search tree problems as 
mentioned in the introduction, it is of general interest to study this growth problem on a regular Euclidean lattice, 
e.g. on a hyper-cubic lattice in d dimensions. In this lattice model, the cluster will grow similarly from a seed site 
at the origin. At a given instant, growth can occur at any of the available surface sites with a rate c~ r where r is 
the Euclidean distance of the surface site from the origin. One can then investigate the statistics of the total number 
of particles in the cluster after time t. It is easy to make a scaling argument for the growth of the mean number of 
particles M(t). Assuming that the cluster is compact with a typical radius R(t) at time t, we have M(t) ~ [R(t)] d . 
Also, the mean number of surface sites N p (t) ~ By the growth rule, dM/dt ~ N p (t)c~ R ^ for large t. This 

predicts R(t) ~ ln(i) for large t and hence the mean number of particles grows very slowly as M(t) ~ [ln(t)] d for 
large t. An interesting open question for future studies is whether, in finite dimensional lattice models, the variance 
exhibits a phase transition, similar to that seen on the tree, for some critical value of the parameter c? 
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